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ABSTRACT 

Determination of microlensing parameters in the gravitationally lensed quasar 
Q2237+0305 from the statistics of high magnification events will require monitoring 
for more than 100 years (Wambsganss, Paczynski & Schneider 1990). However we 
show that the effective transverse velocity of the lensing galaxy can be determined 
on a more realistic time-scale through consideration of the distribution of light-curve 
derivatives. The 10 years of existing monitoring data for Q2237+0305 are analysed. 
These data display strong evidence for microlensing that is not associated with a 
high magnification event. An upper limit of v t < 500 fcmsec -1 is obtained for the 
galactic transverse velocity which is smaller than previously assumed values. The 
analysis suggests that the observed microlensing variation may be predominantly due 
to stellar proper motions. The statistical significance of the results obtained from 
our method will be increased by the addition of data points from current and future 
monitoring campaigns. However reduced photometric errors will be more valuable than 
an increased sampling rate. 

Key words: gravitational lensing - microlensing - quasars - numerical methods. 



1 INTRODUCTION 

Q2237+0305 is a quasar at redshift 2=1.695 that is gravi- 
tationally lensed by a foreground spiral galaxy (.2=0.0394). 
Due to the unusually close alignment, the quasar is sepa- 
rated into 4 resolved images having separations of ~ 1". 
These images are seen through the bulge of the lensing 
galaxy, and so Q2237+0305 is an excellent candidate for the 
observation of gravitational microlensing. Since the discov- 
ery of microlensing in Q2237+0305 (Irwin et al. 1989, Cor- 
rigan et al. 1991) various numerical techniques have been 
used to model the observed continuum flux variations (eg. 
Wambsganss, Paczynski & Katz 1989; Witt, Kayser & Refs- 
dal 1993). The models demonstrate that it will be possible 
to use the statistics of high magnification events (HMEs) 
from monitoring over a large period of time (> 100 years) 
(Wambsganss, Paczynski & Schneider 1990) to determine 
properties of the lens such as the the stellar mass function 
and the percentage of mass in compact objects. 

The most important unknown parameter in microlens- 
ing models is the galactic transverse velocity. In the case of 
Q2237+0305 most previous analyses have assumed a value 
of v t — 600 km sec -1 . The uncertainty in the value of trans- 
verse velocity makes the analysis of monitoring data difficult 
because statistics of HMEs are proportional to its value. In 
addition, since the characteristic length or Einstein radius 



is proportional to the square root of the mass, attempts to 
obtain information on the typical mass of microlenses are de- 
pendent on the square of v t . Witt & Mao (1994) have found 
a height-gradient correlation during model HMEs. This cor- 
relation is dependent on the source size assumed, however 
they have interpreted the observed candidate HMEs in light 
of this correlation and find a transverse velocity that is lower 
than 600 km sec~ . The rate of rise or fall during the initial 
or final stages of the HME is dependent on the angle between 
the trajectory and the relevant caustic, the source size and 
profile, as well as the transverse velocity. At least part of this 
degeneracy will need to be broken in order for a successful 
analysis of any HME to be made. 

The contribution to microlensing of proper motions of 
microlenses has not been included in the aforementioned 
analyses. This neglect is due to the computational difficul- 
ties involved in the calculation of light curves from models 
which include stellar proper motions. The assumption of a 
static lens configuration requires an effective galactic trans- 
verse velocity that is an order of magnitude or so higher than 
the typical random motion. This is not a good assumption 
however if the unknown transverse velocity is of a similar 
magnitude or smaller than the stellar velocity dispersion. 

The effect of proper motions on microlensing has been 
discussed by several authors (Schramm et al. 1992; Kundic 
& Wambsganss 1993; Kundic, Witt & Chang 1993; Wamb- 
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sganss & Kundic 1995). Kundic & Wambsganss 1993 and 
Kundic, Witt & Chang 1993 have discussed the manner 
in which stellar proper motions increase the frequency of 
HMEs. They agree quantitatively on the manifestation of 
this effect in the case of image A of Q2237+0305, but not 
on how it is manifested in general. Wyithe, Webster & 
Turner (1999) considered the effect of proper motions in 
terms of the distribution of resulting light curve derivatives. 
An analysis of the expected contribution of proper motions 
to microlensing in Q2237+0305 is presented as part of the 
current work, and is based on formalism developed in that 
paper. 

In addition to the difficulties presented by the relatively 
rare occurrence of HMEs, the analysis of current monitoring 
data is hampered by the fact that the sparse sampling rate 
may have lead to HMEs being missed altogether. However, 
comparatively speaking the monitoring data provides a good 
record of the longer duration low level fluctuations which do 
not involve new critical images or (very rarely) a cusp. This 
shift in focus vastly increases the number of data points that 
can be analysed to determine the physical properties of the 
system. 

In this paper we investigate the distribution of the rate 
of change of magnification in the light-curve, particularly 
in regions of the light-curve where the size of the fluctua- 
tion is relatively small compared to that during an HME. 
The advantages of such an approach are numerous. Firstly, 
far from a caustic where the rates of fluctuation are small, 
the light-curve is approximately independent of source size 
and intensity profile. Secondly, a given section of light-curve 
has a typical magnification that is dependent on the local 
region of the magnification pattern. Due to the clustered 
nature of the caustic network in the presence of a shear 
(as for Q2237+0305), the average magnification in a re- 
gion of a light-curve can differ from the theoretical mean 
{fJ-av = 1/|(1 — k) 2 — 7 2 j) by a significant amount for a pe- 
riod of up to a few decades. A plot of the derivative is not 
expected to exhibit clustering of this type as it is approxi- 
mately independent of the number of slowly varying pairs of 
semi-critical images associated with a source having crossed 
caustics. 

This paper is presented in seven parts. Section |^ shows 
how the transverse velocity can be estimated from the dis- 
tribution of the rates of change of the difference between 
the image magnitudes. In sections ^| and ^, the effects of 
observational and physical parameters on the measurement 
of transverse velocity are discussed. Section |H| describes the 
effect of stellar proper motions on the derivative distribution 
and in section ^ the theory is applied to the monitoring data 
of Irwin et al. (1989), Corrigan et al. (1991) and 0stensen 
et al. (1995). The results obtained are presented in section 





2 MEASURING THE EFFECTIVE 
TRANSVERSE VELOCITY 

The observed microlensing rate is produced by the combi- 
nation of microlens proper motions and a galactic trans- 
verse velocity. We define the effective transverse velocity as 
being that which in combination with a static microlens- 
ing model, produces a microlensing rate equal to that of 



the observed light curve. The effective transverse velocity is 
therefore larger than the physical transverse velocity. This 
section describes the method used to estimate the effective 
transverse velocity from monitoring data. 



2.1 Microlensing models 

Throughout the paper, standard notation for gravitational 
lensing is used. The Einstein radius of a 1M© star in the 
source plane is denoted by r\ . The normalised shear due to 
external mass is denoted by 7, and the convergence or optical 
depth by k. The normalised lens equation for a field of point 
masses with an applied shear in terms of these quantities is 

V 7 N stara 

Here x and y are the normalised image and source position 
respectively, and the xl are the normalised positions of the 
point masses, k* and k c are the optical depth in stars and 
smoothly distributed matter respectively. Where required, a 
cosmology having Q. = 1 with Ho = 75 km sec -1 is assumed. 

To construct light-curves that result from microlens- 
ing of a point source we use the contouring method (Lewis, 
Miralda-Escude, Richardson & Wambsganss 1993; Witt 
1993). Where a light curve is required for an extended 
source, we use the 2 dimensional extension of the contour- 
ing method (Wyithe & Webster 1999) or an equivalent 1- 
d approximation (eg. Witt & Mao 1994). The microlensing 
models consist of a homogeneous disc of point masses. Katz, 
Balbus & Paczynski (1986) obtained an expression describ- 
ing the region of the lens plane in which image solutions need 
to be found to ensure that ~ 99% of the total macro-image 
flux is recovered. This can be termed the flux collecting re- 
gion. The union of flux collecting regions that correspond 
to each point on the source line is termed the shooting re- 
gion. The dimensions of the shooting region are determined 
in the standard manner (eg Lewis & Irwin 1995; Wyithe & 
Webster 1999), and the disc of point masses chosen to have 
a radius that is 1.2 x that required to cover this region. The 
numbers of stars used in the microlensing models under the 
different assumptions for the mass distribution used in this 
paper are given in table 0. 



2.2 The Distribution of Derivatives 

The basic tool of our analysis is the distribution of light 
curve derivatives in a given simulation, collection of simu- 
lations or set of observational monitoring data. The trans- 
verse velocity is a scaling factor in the light curve derivative. 
Figure |l| shows the cumulative distribution functions of light 
curve derivatives for models of image A of Q2237+0305 that 
have a positive shear (« = 0.36, 7 = +0.4) and transverse 
velocities ranging between 100 km sec' 1 and 2700 km sec -1 . 
The cumulative histogram has a smaller value if the trans- 
verse velocity is higher because at higher transverse veloc- 
ities a larger proportion of points have larger derivatives. 
Thus by considering the shape of the cumulative distribu- 
tion of light curve derivatives, information on the transverse 
velocity can be obtained. 
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Liqht Curve Gradient (Maqnitudes per year) 
Figure 1. The cumulative histograms calculated using the trans- 
verse speeds 100, 300, 900, and 2700km sec -1 (Thinner lines de- 
note smaller velocities). The point masses are distributed with a 
Salpeter mass function p(m)dm oc m — 2,35 dm and O.IMq < m < 
l.OMg. The light curves were computed for a point source. The 
optical depth was re = 0.36, and the shear 7 = 0.4. 



2.3 General Considerations 

As we show, treating the transverse velocity as a variable 
allows us to obtain limits on its value. When analysing data 
from a gravitationally lensed quasar such as Q2237+0305, 
we are interested in the independent fluctuations in the in- 
dividual images. It is advantageous to look at the rates of 
change of the independent differences between the magni- 
tudes of the images. There are two reasons for this. Firstly 
any intrinsic fluctuation that is present in the source quasar 
will be removed by this procedure. We note however, that 
in general, the time delay must be taken into account. For 
Q2237+0305 the delays are less than one day (Schneider et 
al. 1988) and so are much shorter than the intrinsic source 
variability timescale. The second advantage is that when 
constructing a light curve from observations, the error in the 
photometry comprises two components (Irwin et al. 1989). 
The first of these is the random error due to profile fitting 
for the brightness at the position of each quasar image. The 
second error is approximately systematic across all images 
and is due to errors produced in the procedure of subtract- 
ing a suitably scaled model galaxy from the image. If the 
magnitudes in two different observations of a single image 
are being compared, then the systematic errors need to be 
taken into account. However the systematic component of 
the error is unimportant when comparing the difference be- 
tween the magnitude of two different images at two different 
times. In the case of Q2237+0305 the histogram is composed 
of fluctuations in six differences between the image magni- 
tudes (A - B, A - C, A - D, B - C,B - D, C - D). While 
these 6 differences still only contain 4 degrees of freedom, 
all 6 are needed so that the level of fluctuation measured is 
not biased towards any one image. 

It is assumed that the macro models (eg. Schmidt, Web- 
ster & Lewis 1998) for the lensing galaxy in the Q2237+0305 
system correctly describe the shear and optical depth pa- 
rameters at the position of each image. The microlensing 



model of Q2237+0305 includes the following unknowns as 
input parameters: 

i) Transverse Velocity- This is thought to be ~ 
600 fcmsec -1 , a quantity obtained through probabilistic ar- 
guments in combination with general cluster dynamics (eg. 
Mould et al. 1993). Witt & Mao (1994) find a lower value, 
however their measurement is dependent on source size. 

ii) Trajectory direction- The shear term at each image 
breaks the circular symmetry so that the direction of the 
transverse component of the galactic velocity will influ- 
ence microlensing statistics. This direction is completely un- 
known. 

Hi) Source Size- This is thought to be < 2 x 10 15 cm (eg. 
Wambsganss, Paczynski & Schneider 1990; Rauch & Bland- 
ford 1991), however this measurement is based on one poorly 
sampled HME. The microlensing value is corroborated by 
the length scale, ct ~ 10 14 th rs cm, associated with X-ray 
and optical changes in the continuum emission. A source size 
of ~ 10 15 cm is also consistent with the typical scalesize of 
a continuum emitting accretion disc about a super-massive 
black hole (Rees 1984). 

iv) Source Intensity Profile- This is dependent on the model 
assumed for the continuum source. It is only an important 
factor during a HME (Wyithe & Webster 1999). 

v) Microlens Properties- The mass limits and mass function 
of the stars and compact objects, as well as the size of the 
smooth matter component of the optical depth in the lensing 
galactic bulge must be assumed based on information about 
these quantities in our own galaxy. 

2.4 The Kolmogorov Smirnov Statistic 

The comparison between monitoring data of Q2237+0305 
and ensembles of simulations is made through indirect use of 
the Kolmogorov- Smirnov (KS) statistic. For two cumulative 
distributions F and G, the KS statistic is defined: 

D = max(\F — G\). (2) 

This statistic is equivalent to the probability of the null hy- 
pothesis that F and G are different. This equivalence relies 
on the data points used to build the distributions F and G 
being independent. Unfortunately when the histograms are 
comprised of simulated or observed microlensing data, the 
component points are not independent because of the com- 
mon region of caustic network from which they are drawn. 
The KS statistic is useful however for comparing the data 
to a range of simulations. 

For a given transverse velocity, the time averaged distri- 
bution is produced from a large collection of 10 year sampled 
simulations. The KS statistic is then found between each of 
the simulated histograms and the time averaged histogram. 
This produces a cumulative distribution of KS statistics 
P(D S i m (vt)) for that transverse velocity v t which describes 
the spread in simulations due to the combination of caus- 
tic clustering and small monitoring length. The KS statistic 
can then be found between the time averaged histogram 
and the observational histogram (D b s (vt))- The suitability 
of the transverse velocity for describing the data set is then 
discussed in terms of the relationship of D b s (vt) to the sim- 
ulated distribution P(D a i m (vt))- At a transverse velocity vt, 
P(D bs(vt)) finds the fraction of simulations that are more 
consistent with the mean than the observations . 
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Figure 2. The cumulative histograms of KS differences between 
time averaged and 500 sampled simulations for different sampling 
rates and monitoring periods. The thin lines correspond to a fO 
year monitoring period, and the thick line to a 20 year monitoring 
period. The light lines correspond to a sampling rate of 1 point per 
f 5 days, and the dark lines to f point per 30 days. Both sampling 
rates are for 6 months per year. The models were calculated from 
sampled difference light curves of models of Q2237+0305 for a 
point source which contained no simulated observational errors. 
The transverse velocity was 400 km sec~ . The point masses are 
distributed with a Salpeter mass function p(m)dm ex. m — 2 35 dm 
and 0.1M@ < m < I.OMq . The optical depth and shear values 
were those of Q2237+0305, with 7a>7b > and jc-i Id < 0. 

To compute the upper limit of the transverse velocity 
it is natural to modify the KS statistic to find whether the 
distribution has many more high derivatives than the mean. 
Such a measure is: 

D v = max(F -G), (3) 

where F is the time averaged histogram. Conversely, a lower 
limit is obtained through the statistic 

Dl = max(G — F). (4) 

The normal KS statistic (Eqn ^) can be used to find the 
most likely transverse velocity, though this approach is much 
more susceptible to accepting a false hypothesis than to than 
rejecting a true one, and so the expected value must be 
treated with more caution than the limits. 



3 THE EFFECT OF OBSERVATIONAL 
PARAMETERS 

3.1 The Effect of Sampling Rates and Monitoring 
Periods 

When a histogram is produced from a sampled light curve, 
its shape is dependent on the sampling rate. In the case of a 
sparsely sampled curve the shape is also dependent on the 
type of numerical derivative computed. A two point deriva- 
tive will optimally sample noise in the data, while a higher 
order calculation will smear out real fluctuations due to the 
low sampling rates. A simulated observational error is built 
into the model. The greatest difference between histograms 
calculated for noise only and those calculated for a light- 
curve plus noise is obtained using the three point deriva- 
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Light Curve Gradient (Magnitudes per year) 
Figure 3. The cumulative histograms of derivatives calculated 
from sampled difference light curves of models of Q2237+0305 
for a point source which contain simulated observational errors. 
The thicker lines represent the larger errors. Three la levels were 
used, AM = ±0, ±0.01, ±0.04. The sampling rate was 1 point 
per 15 days for 6 months per year, and the transverse velocity 
was 400 fcm sec -1 . The microlensing model was the same as that 
described for Figure ^. 

tive. We therefore choose to use a three point derivative for 
the calculating the derivative histograms of sampled light 
curves. 

Figure ^ displays examples of the distributions 
P(D S i mu i) of KS values between the simulations and time 
averaged distribution. The figure shows histograms corre- 
sponding to sampling rates of 1 point per 15 and 30 days, as 
well as monitoring periods of 10 and 20 years. The size of the 
determined range of transverse velocities is equivalent to the 
fraction of large KS differences found in the simulations. We 
find that the different sampling rates considered have very 
little effect on the size of the determined range of transverse 
velocities. In contrast, the length of the monitoring period 
has a significant effect on how good a representation a given 
simulation is of the time average. The longer monitoring pe- 
riod has a higher fraction of simulations with smaller KS 
differences, which indicates that at the low transverse ve- 
locities being considered here, the sampled part of the light 
curve has approximately the same length or is shorter than 
a typical cluster of caustics. This is expected since in the 
limit of large monitoring periods, the distribution of KS dif- 
ferences is a step function at zero. For the present problem 
it is therefore the length of the monitoring period rather 
than the time between observations that is more important 
in obtaining an accurate statistical measure. 

3.2 The Effect of Observational error 

Figure ^ displays the effect on the derivative histogram of 
the inclusion of observational error into the models. Obser- 
vational error is simulated as an additional random compo- 
nent in the model light curve that is distributed according to 
a Gaussian. From figure ^| we see that the inclusion of errors 
has the effect of lowering the proportion of small derivatives 
in the light curves. This effect is the manifestation of the ob- 
servational noise, and is due to the measurement uncertain- 
ties introducing independent artificial fluctuation into the 
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Source Position (Einstein Radii) 

Figure 4. The microlensed light curves for a point source (light 
line) and 0. lrjo source. The extended source light curve was com- 
puted using the 2D contouring method, and has a limb darkened 
profile. The point masses are distributed with a Salpeter mass 
function p(m)dm oc rn~ 2 - 35 dm and O.IMq < m < I.OMq. The 
light curves were computed for a point source. The optical depth 
was K = 0.36, and the shear 7 = +0.4. 

fiat parts of the light curves. In cases of a large observational 
error, the dearth of low derivatives result in a lower estimate 
of transverse velocity. Where the errors are extremely large, 
the microlensing signal will therefore be lost in the noise. 
This would be indicated by a measure of transverse velocity 
which is not significantly above zero. This suggests that a 
reasonable understanding of the size and nature of observa- 
tional errors is required for a successful analysis to be made. 
Indeed, for a determination of transverse velocity, it is more 
important to have smaller errors than higher temporal reso- 
lution. The inclusion of errors has a negligible effect on the 
size of the measured range of transverse velocity. 



4 THE EFFECT OF VARYING SOURCE SIZE 

The measurement of transverse velocity that we make is 
systematically affected by the assumptions made for the 
model parameters of source size, trajectory direction, and 
microlens mass function. The effect of the unknown source 
size on our measurement is discussed below. The effects of 
the trajectory direction, and microlens mass function are 
deferred to section ^ where they are discussed in the con- 
text of measurements made from the monitoring data on 
Q2237+0305. 

The detail of a light curve, particularly during HMEs 
is dependent on both the size and intensity profile of the 
source. Figure ^| shows model light curves for image A of 
Q2237+0305 ("/a ~ +0.4) in both the cases of a point source 
and a 0.1r) o diameter source with a limb darkened profile. It 
is clear from this diagram that a point source is subject to 
much larger fluctuations during HMEs than a larger source. 
The smaller source size however also means that the source 
spends less time in contact with a caustic so that HME scale 
derivatives occur less often. This results in a relative excess 
of the largest light curve derivatives in the case of the point 
and smaller sources, which is offset by an increase in the 
number of small light curve derivatives. 



The light curve of an extended source can be produced 
through an appropriately weighted integration along the 
point source light curve (eg. Witt & Mao 1994). Such an 
approximation obtains event amplitudes which are close to 
their correct values, although the details of the events are 
not accurate. The approximation assumes that the caustic 
is straight and perpendicular to the source trajectory. In ad- 
dition it assumes that their are no other caustics within a 
source radius of the source line on either side. Neither of 
these assumptions will be true in general. We use the ap- 
proximation in the current work as we are not concerned 
with the detail of HMEs due to the low sampling rate. 

When a sampling rate is applied to the light curve, a 
smaller source may experience fewer large fluctuations. De- 
pending on the set of observations, a higher transverse ve- 
locity may therefore be required to produce a given level of 
fluctuation and so a correspondingly smaller transverse ve- 
locity measured where a larger source size is assumed. This 
is in contradiction to the theoretical histogram. 



5 MICROLENSING DUE TO STELLAR 
PROPER MOTIONS 

The above models include stellar positions that remain 
constant during the period of the simulation. These mod- 
els produce unrealistic simulations because the measured 
line-of-sight stellar velocity dispersion of the bulge is 
~ 215 fern sec" 1 (Foltz, Hewitt, Webster & Lewis 1992), 
and models of the lensing galaxy suggest values of ~ 
165km sec' 1 (Schmidtt, Webster & Lewis 1998). These 
can be compared with the expected transverse velocity of 
~ 600fcm sec" 1 . In the remainder of this work we make two 
assumptions about the stellar velocity dispersion. Firstly 
we assume that the stellar velocity dispersion is isotropic, 
and secondly that a line of sight velocity dispersion of 
~ 165 km sec' 1 is representative of its value at each of the 
4 image positions. 

Theoretically the histogram of light curve derivatives 
provides a natural way to take account of the effect of 
stellar proper motions in the microlensing model. The ap- 
proach is convenient because rather than having to compute 
many light-curves, each with a slightly evolved starfield (eg. 
Kundic & Wambsganss 1993 and Wambsganss & Kundic 
1995), we only need to compute the light-curve derivative 
at each point. This is because for the determination of the 
average behaviour, the derivatives do not need to be sequen- 
tial along a computed light curve. The formalism for this 
process is developed in Wyithe, Webster & Turner (1999), 
but is described briefly below. 

Analytical expressions were obtained for the rate of 
change of the amplification of a given image. Through 
the contouring method images are found for each source 
point along a source trajectory. The derivatives of each 
image of the source are then added together to give the 
change in magnification: 

^ = f>» ( ,,>^ (5) 

These derivatives facilitate the construction of the cumula- 
tive distribution of light curve derivatives analogous to those 
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TRANSVERSE VELOCITY 

Figure 5. The relationship between the equivalent transverse ve- 
locity and the true transverse velocity (dark line) calculated from 
the difference light curves of models of Q2237+0305 for a point 
source. The light lines are the 1 standard deviation uncertainty 
computed from 3 simulations, and the dotted line shows the line 
of equality. The point masses are distributed with a Salpctcr mass 
function p(m)dm on rn~ 2 - 35 dm and O.IMq < m < I.OMq. The 
optical depth and shear values were those of Q2237+0305, with 
7A, 7s > and -y c , "ID < 0. 



Table 1. Table showing the relationship between the histograms 
of derivatives for light curves of point sources where the flux varia- 
tion is due to proper motion and that where it is due to transverse 
velocity with a static lens. The details are described in the text 
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Velocity 


KS difference 


7A,7B > 0, 7c 


713 < 


300 ± 10 


.010 


7A,7s < 0, 7c 


ID > 


280 ± 10 


.019 



in figure uj. As the same caustics and source are involved, we 
expect the cumulative distributions resulting from the two 
classes of motion to be similar up to a constant scaling factor 
in the derivative. We define an equivalent transverse veloc- 
ity to be that in a static model which produces a cumula- 
tive distribution having the smallest possible KS difference 
between itself and a distribution composed of derivatives 
resulting from a point mass velocity dispersion. 

Table [l] shows the equivalent transverse velocity com- 
puted from the derivatives of the set of 6 difference light 
curves computed in the case of Q2237+0305. The histograms 
are very similar in form with minimised KS differences of 
~ 10" 2 . The errors quoted (and similar errors which follow) 
were calculated as the standard deviation between values 
obtained from three separate sets of simulations. It is im- 
portant to note the large contribution being made to the 
microlensing rate by the velocity dispersion, which is larger 
than that due to a transverse velocity of equal magnitude. 

The derivative at each point on a microlensed light 
curve is the result of fluctuations due to both the trans- 
verse velocity and the changes in the magnification pattern 
with time that result from the changing stellar positions. 



The derivative is additive, and this allows the theoretical 
histogram to be computed for the combination of the two 
effects by adding the derivative due to transverse velocity 
and that due to stellar proper motions at each point. 

An analogous process to that already described allows 
us to find the effective transverse velocity in a static model 
that produces a microlensing rate equivalent to a model 
where proper motions and a transverse velocity are con- 
sidered in combination. Figure |E| shows the relationship be- 
tween the effective transverse velocity and the galactic trans- 
verse velocity in a model of the 6 difference light curves of 
Q2237+0305. The point masses were given a 1-d velocity 
dispersion of 165 fcm sec -1 . In this diagram the dark line 
represents the mean and the light lines the ± 1 standard de- 
viation values. The application of plots such as that shown in 
figure |B| is to subtract off the contribution of proper motions 
from measurements of the effective transverse velocity that 
are made. This process relies on the fact that the distribu- 
tion of fluctuations is independent of whether they are due 
to transverse velocity only, or a transverse motion in combi- 
nation with stellar proper motion. In practice, account also 
needs to be taken of the sampling rate. However the number 
of caustics per unit area of the source plane is unchanged 
through the inclusion of proper motion. The same is true of 
the typical amplitude of fluctuations. Given that the distri- 
bution of the derivatives is also of the same form, we expect 
the sampling rate to have approximately the same effect at 
a given level of fluctuation in the two classes of model. The 
technique of conversion between an effective and a physical 
transverse velocity makes it practical to explore a range of 
microlensing models that include the stellar proper motion 
and compare these to observations. 

6 APPLICATION TO Q2237+0305 
6.1 The Monitoring Data 

The gravitational lens Q2237+0305 has been monitored by 
several groups. Kent & Falco (1988) and Schneider et al. 
(1988) imaged the 4 separate quasar images and produced 
models which accurately reproduced their positions. Later 
monitoring programs were undertaken by the collaborations 
Irwin et al. (1989), Corrigan et al. (1991) and 0stensen et 
al.(1996). By combining this data, a light curve is produced 
for each of the four images. The most complete light curves 
are in R-band, and it is these curves that are analysed here. 
There are a total of 61 published data points for each im- 
age taken between 1985, and 1996. In an effort to minimise 
the noise in the data, any points in the sample which were 
taken within one week of each other were averaged and their 
quoted errors added in quadrature. Following this procedure, 
any points having an associated error above Am = 0.05 
magnitudes were removed from the sample since data points 
with large errors substantially degrade the measurement of 
transverse velocity through introduction of noise into the 
low derivative regime. There were also two data points that 
were discarded since they displayed correlated flux variation 
in two images. This leaves 26 points. Figure ^| displays the 
complete data set (thin line) as well as the set following 
modification with the above procedure (thick line). 

Irwin et al. (1989) estimated the differential random er- 
rors in their data to be Am = ±0.01 — 0.02. Accordingly, 
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Figure 6. The observed light curve of Q2237+0305. The thin line shows the entire data set, while the thick line shows the subset of this 
data which has been used in this analysis. The quoted observational errors are plotted at each point. 
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Figure 7. Left: The six differences between images of the observed light curve of Q2237+0305. Right: The resulting cumulative histogram. 
The quoted observational errors are plotted at each point. 



for this analysis we take the random error in the measured 
magnitude of each image to be Am = ±0.02 magnitudes. 
0stensen et al. (1996) note that the actual statistical qual- 
ity will be better for the stronger A and B components. 
In addition, 0stensen et al. (1996) find that the simulta- 
neous increase in brightness of all four images towards the 
end of the monitoring period has a standard deviation of 
Am — 0.02 indicating that the estimates of error were in- 
deed realistic. The errors have been assigned to the model 
light curves according to a Gaussian distribution. The size 
of the error built into the simulation has an effect on any 
comparison that is made with the data, so we have consid- 
ered two cases. The assigned random error is taken firstly as 
the 2<r level in images A and B, and the la level in images C 
and D, and secondly as the lcr level in images A and B, and 
the i<7 level in images C and D. These cases are expected to 
bracket the uncertainty in the observed image magnitudes. 

From the manipulated data we calculate the variations 
in the 6 image magnitude differences (A-B, A-C, A-D, B-C, 
B-D, C-D). Figure (?] displays the resulting difference light 
curves. Microlensing in this figure is conspicuous as varia- 
tion that is present in three of the six curves. Figure [7] also 
shows the resulting cumulative histogram of derivatives. We 
consider the range of effective transverse velocities obtained 



through comparison of this monitoring data with various 
microlensing models. 

The first thing to note about this interpretation of the 
monitoring data is that there is a significant signal of uncor- 
related fluctuations. The level of noise in the sample can be 
determined by calculating the fluctuations that result from 
the application of the observational uncertainty and sam- 
pling rate to a flat light curve. Figure ^ shows the cumula- 
tive histogram of the rates of change of the difference in the 
image magnitudes of Q2237+0305 together with cumulative 
histograms for the mean, ±lcr and ±2cr values for the cumu- 
lative histogram of derivatives of fluctuations due to observa- 
tional error only. The spread in this distribution results from 
the short sample time scale of the data for Q2237+0305. The 
uncorrelated fluctuation in the low gradient regime is very 
strong evidence for microlensing, particularly when added to 
that from the large independent fluctuations or microlensing 
events that have been observed (Irwin et al. 1989, Corrigan 
et al. 1991). 



6.2 Microlensing Models for Q2237+0305 

To model microlensing in Q2237+0305 we adopt the lens- 
ing galaxy model of Schmidt, Webster & Lewis (1998). This 
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Table 2. The values of average magnification, and the number of stars used in each of the models. Values are shown for each set of 
image parameters and microlens mass model. In the case of the number of stars used, the first value corresponds to the simulations with 
positive shear and the second value to simulations with negative shear. The theoretical magnifications for images A/B, C and D are /i t ^ 
= 4.00, 2.45 and 4.90 respectively. 
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Figure 8. The cumulative histogram of the monitoring data 
(labeled)together with the cumulative histogram for simulations 
where the only fluctuation is due to that from the assigned errors 
(dark line). Also shown are the ±1ct and ±2cr levels (light lines). 
In this case the assigned errors were described by a Gaussian with 
a standard deviation of the quoted observational random error. 



model includes the effect of the bar and produces the follow- 
ing microlensing parameters: image A (k = 0.36, |t*| = 0.40), 
image B (k = 0.36, |t*| = 0.40), image C (k = 0.69, \j\ = 
0.71) and image D (k = 0.59, \j\ = 0.61). We consider 
four models for the distribution of mass in our microlens- 
ing model. These models are as follows: 
Model /: The microlenses are distributed in the range 
O.IM© < m < l.OM© with a Salpeter mass function de- 
scribed by p{m)dm oc m~ 235 dm. 

Model II: The microlenses are distributed in the range 
O.IM© < m < IO.OM0 with a Salpeter mass function de- 
scribed by p(m)dm oc m~ 2 35 dm. 

Model ///: The microlenses are each assigned a mass of 
m = l.OM . 

There is no continuously distributed matter in any of our 
models. None of our models contain low mass (m < O.IM©) 
compact objects. This choice follows the results of Alcock 
et al. (1997) who claim from MACHO experiments that the 
mean mass of dark compact objects in the Milky Way halo 
is between ~ O.2M0 and ~ 0.8Mq. In addition, Schmidt & 
Wambsganss (1998) place limits on the composition of the 
halo in Q0957+561 from the observed lack of microlensing. 
Where the halo is made entirely of compact objects they 
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Figure 9. The magnification distributions for our simulations 
of the images in Q2237+0305. The dark and light lines represent 
simulations that have positive (7 > 0) and negative (7 < 0) shear 
respectively. 



rule out a typical MACHO mass of < 0.001M Q . We note 
here that (as discussed below) a smaller measure of trans- 
verse velocity will result from a model that contains smaller 
compact objects. Our upper limits will therefore be unaf- 
fected by this choice. 

Our microlensing models assume that the mass func- 
tion and range in the bulge is the same at the position of 
each of the four images. The orientation of the bar lies ap- 
proximately along the A-B image axis (Yee 1988). We note 
therefore that this assumption may be rendered invalid if 
the bar population varies from that in the rest of the bulge. 
The images are approximately orthogonal with respect to 
the galactic centre, and so the shear polar in images A and 
B has a direction which is approximately orthogonal to that 
in images C and D. For each microlensing model we con- 
sider two cases, (l) the shear in images A and B is parallel 
with the transverse velocity (described by 7 > 0), and (2) 
in images C and D is parallel with the transverse velocity 
(described by 7 < 0). These bracket the range of possibil- 
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ities for the orientation between the source trajectory and 
the galaxy. 

For each of the four mass models, and for both posi- 
tive (7 > 0) and negative (7 < 0) applied shear, 30 light 
curves were computed for the microlensing parameters cor- 
responding to each image. Each individual light curve had 
a length of 10 Einstein radii. Table ^ shows the numbers 
of point masses required in these models to collect the pre- 
scribed macro-image flux. The values on the left and right 
refer to simulations that have positive shear (7 > 0) and 
negative shear (7 < 0) respectively. Table [| also shows the 
mean magnification found from each of the models together 
with the theoretical average. These values demonstrate that 
our models are accounting for an appropriate percentage of 
the macro image flux. The set of simulations for each sign of 
shear were divided up into 3 subsets (of length 100 Einstein 
radii each). The quoted errors were calculated from a <r n _i 
standard deviation between the resulting 6 values. For com- 
parison with previous publications (eg. Lewis & Irwin 1995) 
figure shows the magnification distributions obtained from 
our models. As required we find that the distributions are 
independent of the sign of the shear. 

Models of the observed light curves were computed us- 
ing a sampling rate identical to that of the set of obser- 
vations. This sampling rate is applied to the various model 
light curves in combination with the simulated observational 
errors. For each of the models considered, an ensemble of 
1500 simulations was produced for statistical comparison 
with the observed light cur ve o f Q2237+0305 through the 



method described in section 2.4 



Figure |10| shows four examples of modelled variations in 
the 6 image magnitude differences, together with the result- 
ing cumulative histograms. The figure also shows the time 
averaged mean calculated from the entire sample (light line). 
In this case the effective transverse velocity in the models 
was 400 km sec -1 . Figure [To] demonstrates that such a short 
monitoring period can mean that the number of large am- 
plitude fluctuations varies wildly depending on the period 
of light curve sampled. 



7 RESULTS 

7.1 Effective Transverse Velocity 

Figure [ll] shows plots of effective transverse velocity vs. per- 
centage of simulations that are more consistent with the 
model time average than the observations in terms of the 
upper limit, lower limit, and mean KS statistics (solid, dot- 
ted, and dot-dashed lines) than the observations. This plot 
measures the most likely as well as the upper and lower 
limits to the effective transverse velocity in the case of a 
type I microlensing model. Similar plots were produced in 
each of the models considered. In each of the three mod- 
els, the effective transverse velocities were found for which 
the cumulative histogram of observations was less consis- 
tent with the time averaged model distribution than 95% 
and 99% of the ensemble of model simulations (V upP er(95%) 
and Vupper (99%)) in the case of the upper limit, and 95% 
of the ensemble simulations in the case of the lower limit 
(Viower(Q5%)). The effective transverse velocity at which the 
histogram of observations was more consistent with the time 




200 400 600 800 1000 

Effective transverse velocity (km/second) 

Figure 11. Plots of the probability of finding a model - average 
model KS difference greater than the observation - average model 
KS difference as a function of effective transverse velocity. The 
solid line and dotted lines correspond to the upper and lower 
limits respectively. The dot-dashed line corresponds to the mean. 
In this case the microlensing model was of type I. The optical 
depth and shear values were those of Q2237+0305, with 7,4, ■yg > 
and 7Ci7d < 0. The observational random error was taken to 
be 2<t for images A and B, and la for images C and D 



average than the greatest number of models was also found 
(Vmean)- These transverse velocities were found in the cases 



of point, O.O2770 (2.6 x 10 1 



and O.IO770 (1.3 x 10 1 



diameter sources. The calculations of the finite source light 
curves were made with the one dimensional approximation 
described in section ^j. Each of the above measurements were 
made using the both assumptions of simulated observational 
error discussed previously. 

Table |3| displays the results obtained for the effective 
transverse velocity (V) computed using the point and O.I770 
diameter sources. The results for Vupper (95%) at all three 
sizes considered are presented in the top panel of Figure 
|l2} The left and right hand plots represent the cases of 
7A,7s > and ja, Jb < 0. In these diagrams the solid, 
dashed and dotted lines represent models I, II, and III 
respectively. The light lines correspond to the quoted obser- 
vational errors being treated as 2a in images A and B, and 
la in images C and D, while the dark lines correspond to 
errors of la in images A and B, and |<r in images C and D. 
From these plots we see that the models yield estimates of 
the effective transverse velocity that are qualitatively con- 
sistent. It is apparent from the plot however that it is the 
model of microlenses chosen that provides the largest un- 
certainty. The ranges of error considered have altered the 
estimated limits on effective transverse velocity by ~ 30%, 
while the source size and the direction of the transverse ve- 
locity provide relatively unimportant contributions to the 
systematic uncertainty in the limits obtained. 

Models with a smaller mean mass, and therefore a larger 
number density of caustics measure a lower effective trans- 
verse velocity. The Einstein radius of a point mass in nor- 
malised units is <Jm. The effective transverse velocities mea- 
sured using two models a and b that have identical macro 
parameters (k, 7), and mass limits whose ratios are equal 
are related by a factor of \J (m a ) / '{mt) ■ It is therefore in- 
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Figure 10. Three typical simulations for a point source with a type / model. A simulated observational random error of lcr for images 
A and B, and ^tr for images C and D is plotted at each point. Left: The six differences between model light curves of the images of 
Q2237+0305. Right: The resulting cumulative histogram (dark line) together with the time average histogram (light line). In this case the 
transverse velocity was 400 km sec -1 . The optical depth and shear values were those of Q2237+0305, with 7Ai7b > and 7Cj7d < 0. 
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Table 3. The effective transverse vefocities (km sec 1 ). The first and second numbers in each coiumn refer to models where 7Aj7b > 0, 
IcrtD < and 7A.7S < 0, 7C.7D > respectively. 
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Figure 12. Top: Plots of the 95% upper limit to the effective 
transverse velocity, and Bottom: Plots of the 95% upper limit 
to the physical transverse velocity as a function of source size 
for model I (solid line), model II (dashed line) and model J/J 
(dotted line). The light lines are calculated using a simulated 
observational error of 2a for images A and B, and la for images 
C and D, while the dark lines correspond to errors twice as large. 
The left hand panels correspond to the case where 7^1 7s > 0,and 
7C , ID < 
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teresting to note the correspondence between this ratio in 
the models we have considered and the effective transverse 
velocities measured. 

In the case of models I and 77: 

\/ {mil} /{mi) = 1.2, V me an(H)/V m ean{I) ~ 1.3. 

In the case of models / and III: 

y/(mni)/{mi) = 2.1, Vmean(III)/V m ean(I) ~ 2. 

The good fit of these values demonstrates that the mea- 
surement of effective transverse velocity is a sensitive func- 
tion only of the mean microlens mass. This result is con- 
sistent with Witt, Kaiser & Refsdal (1993) and with Lewis 
& Irwin (1996) who found that the characteristic time-scale 
for variability scales as (m). In addition, figure [] demon- 
strates that the magnification distribution is independent of 
the mass function and mean microlens mass (Lewis & Irwin 
1995). For these reasons we have not explored models with 
mass functions described by a larger range of power laws or 
mass limits. 

A major unknown in the problem is the direction of the 
source trajectory. Simulations by Witt, Kayser & Refsdal 
(1993) and others show that for image A the rate of HMEs 
is significantly higher for a source moving parallel to the 
shear (7 > 0), similar results are obtained for images C and 
D, although the effect is more pronounced. A combination of 
this effect in each of the 4 images of Q2237+0305 means that 
the shape of the derivative histogram from the difference 
light curves is dependent on the direction of the transverse 
velocity. Figure [13 shows that the measurement of effective 
transverse velocity is slightly smaller where the sign of the 
shear in images A and B is negative. 

While the results are model dependent, some general 
conclusions are readily apparent. Firstly, in all models con- 
sidered the upper limits of effective transverse velocity are 
of the same order as, and the most likely values significantly 
lower than the previously assumed value for the transverse 
velocity of 600 km sec -1 . This suggests that the transverse 
velocity and stellar velocity dispersion have similar magni- 
tudes, and that microlensing by random motion of stars may 
play a significant role in producing the observed continuum 
variation. In addition, the lower limit to the effective trans- 
verse velocity (Vj mi >er(95%)) is greater than zero for all mod- 
els considered. This result is required as evidence that there 
is microlensed continuum variability which is above the level 
generated by observational noise, and is a quantification of 
figure |. 
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7.2 The Transverse Velocity 

The conversion from an effective transverse velocity to a 
physical one was made using the upper limit of relationships 
such as the one shown in figure |5j. This corresponds to the 
subtraction of a lower limit to the contribution of proper 
motions to the microlensing rate. The subtraction of the 
proper motion component is of a similar magnitude for all 
models. The range of velocities is however increased because 
the difference between effective and true transverse veloc- 
ity is less for larger transverse velocities (see figure ^).The 
results are presented in Table ^. The measured values of 
transverse velocity are higher for smaller source sizes and 
smaller simulated errors. Importantly, the most likely trans- 
verse velocity is significantly lower than the expected value 
of 600 km sec" 1 , with values that are zero in most cases con- 
sidered. This zero value does not signify a lack of microlens- 
ing, but rather that there is not an excess of microlensed 
fluctuation above that which is expected from proper mo- 
tions alone. 

The lower plots in figure [Hj show the values of trans- 
verse velocity corresponding to the effective transverse veloc- 
ity plotted in the upper part of the figure. From the results 
of these models we estimate a maximum expected value for 
the galactic transverse velocity. From the results of the 95% 
upper limit we place the expected galactic transverse ve- 
locity at v t < 500 km sec -1 . This upper limit demonstrates 
that the galactic transverse velocity is probably significantly 
lower than the value that has been assumed for analyses of 
microlensing in Q2237+0305 in the past. 



8 CONCLUSION 

Through consideration of the distribution of light curve 
derivatives we find that the ten years of existing moni- 
toring data for Q2237+0305 contains a statistically signifi- 
cant level of uncorrelated and therefore microlensed varia- 
tion in the continuum flux magnitudes of its four images. 
This microlensing is predominantly at a level below that 
which would be considered to be a HME. Through consider- 
ation of the total rate of microlensed variation, and the con- 
tribution to microlensing of stellar proper motion we have 
placed an upper limit on the galactic transverse velocity of 
Vt < 500 km sec -1 . This measurement is not significantly de- 
pendent on the size or intensity profile of the continuum re- 
gion, or on the direction of the source trajectory. The choice 
of microlensing model in combination with the reliability 
of the error estimates produces the greatest uncertainty in 
the problem, however the estimate is qualitatively consistent 
over a range of possible models. This data suggests that the 
transverse velocity of the lensing galaxy of Q2237+0305 is 
likely to be low, in which case the observed microlensing is 
primarily due to the proper motion of microlenses in the 
bulge. 

Analyses such as the one discussed here will provide 
more rigorous limits when data from a longer monitoring 
period becomes available. The statistical significance of a 
measurement of transverse velocity will not be substantially 
aided by a sampling rate smaller than about 1 point per 
month. However, knowledge of the errors involved is impor- 
tant to obtaining a reliable limit. 
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Table 4. The transverse velocities (km sec 1 ) . The first and second numbers in each column refer to models where 7 a , IB > 0, -yc , Jd < 
and 7a, 7s < 0, fCi Id > respectively. 
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